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ABSTRACT 

One of the main science goal of the future European Extremely Large Telescope will be to 
O ■ understand the mass assembly process in galaxies as a function of cosmic time. To this aim, 

a multi-object, AO-assisted integral field spectrograph will be required to map the physical 
^ . and chemical properties of very distant galaxies. In this paper, we examine the ability of such 

, an instrument to obtain spatially resolved spectroscopy of a large sample of massive (0.1 < 

M stellar < 5 X lO^^M©) galaxics at 2 < z < 6, selected from future large area optical-near IR 
surveys. We produced a set of about one thousand numerical simulations of 3D observations 
^ ' using reasonable assumptions about the site, telescope, and instrument, and about the physics 

! of distant galaxies. These data-cubes were analysed as real data to produce realistic kinematic 

measurements of very distant galaxies. We then studied how sensible the scientific goals are 
I to the observational (i.e., site-, telescope-, and instrument-related) and physical (i.e., galaxy- 

related) parameters. We specifically investigated the impact of AO performance on the science 
goal. We did not identify any breaking points with respect to the parameters (e.g., the telescope 
diameter), with the exception of the telescope thermal background, which strongly limits the 
. performance in the highest (z>5) redshift bin. We find that a survey of Ngai galaxies that fulfil 

■ the range of science goals can be achieved with a ~90 nights program on the E-ELT, provided 

a multiplex capability M ~ Ngai/^. 

■ Key words: Galaxies: evolution - Galaxies: high-redshift - Galaxies: kinematics and dynam- 
^1 ics - Instrumentation: adaptive optics - Instrumentation: high angular resolution - Instrumen- 
tation: spectrographs 



o 



1 INTRODUCTION 

Over the last decade, the synergy of 8-10 meter class telescopes 
with HST has strongly re-invigorated the field of galaxy forma- 
tion and evolutio n by un veiling^e ry distant g alaxies u p t o z- 6 



(e.g., ICubv etaP 120031 : iRhoads et all l2003l : iBremer et all l2004l : 



iBouwens et al.l 200 ^, by allowing the first determinati on of the 



global star formation history since redshift z~6 (e.g., iHopkinsI 
I2OO4V and by providing the first insights on the stel lar mass as- 
semb l y history out to z-5 (e.^., Drorv et al. 2005: Pozzetti et al.l 
l2007l : fMarchesini et aLll2007l : IPerez-Gonzalez et aLll2008h . Despite 
these recent progresses, the outstanding question remains on how 
and when galaxies assembled their baryonic mass across cosmic 
time. The CDM standard model has provided a satisfactory sce- 
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nario describing the hierarchical assembly of dark matter halos, in a 
bottom-up sequence which is now well-established over the whole 
mass structure spectrum. In contrast, little progress has been made 
in the physical understanding of the formation and evolution of the 
baryonic component because the conversion of baryons into stars 
is a complex, poorly understood process. 

As a result, all intellectual advances in galaxy formation and 
evolution over the last decade have been essentially empirical, of- 
ten based on phenomenological (or semi-analytical) models, which 
heavily rely on observations to describe, with simplistic rules, such 
processes as star formation efficiency, energy feedback from star 
formation and AGN, chemical evolution, angular momentum trans- 
fer in merging, etc. Cornerstones observations in this empirical 
framework are the total and stellar mass of galaxies and their phys- 
ical properties, including the age and metallicities of their under- 
lying stellar populations, dust extinction, star formation rate, and 
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structural/morphological parameters. The study of well-estabUshed 
scahng relations involving a number of these physical parame- 
ters (e.g. mass-metallicity, fundamental plane, colour-magnitude, 
morphology-density) are essential for understanding the physical 
processes driving galaxy evolution. However, with the current gen- 
eration of lOm-class telescopes, we have been able to construct 
for example the fundamental plane of early-type galaxies, or to 
measure the Tully-Fisher relation of late-type galaxies over a wide 
range of masses only at low and intermediate redshifts (z<l), 
whereas only the brightest or most massive galaxies have been ac- 
cessible at z > 1, and a direct measurement of masses has been al- 
most completely out of reach at z > 2. Thus, our ability to explore 
the evolution and origin of the aforementioned scaling relations has 
rapidly reached the limit of lOm-class telescopes. 

Hence, most of the outstanding questions arisen from recent 
observational galaxy evolution studies which have pushed the 10m- 
class telescopes to their limits call for an Extremely Large Tele- 
scope (ELT), specifically to extend the spectroscopic limit by at 
least two magnitudes with near-diffraction limit angular resolution. 
IFU spectrographs on 8- 10m class telescopes (e.g., VLT/SINFONI 
and Keck/Osiris) are now routinely deriving the spatially-resolved 
kinematics of massive (i.e., with stellar masses larger than 



10 "Mq), distant galaxies, from z-0.4 to z-3 ([Flores et al 
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H;|l emoine-Busserolle et al ]l200i. Such studies have brought 
new and essential insights into galaxy evolution processes, such as 
the fraction of rotators as a f unction of redshift ( Yang et aLll2008l : 
iForster Schreiber et al.l 12009*), a b etter unde rstanding of t he evo- 
lution of the Tully-Fishe r relation JPuech et al. 2008 : Cresci et al.l 
I2OO9I : IPuech etal ll2009h . or the first glimp s e into the evolutio n 
of the angular momentum ("Puec h et al.ll2007l : lBouche et al.ll2007h . 
However, several uncertainties and limitations remain, especially 
at the highest redshifts (i.e., z>l). For instance, z~2 surveys were 
drawn from diff'erent selection criteria (e.g., BX, BM, or BzK 
galaxies), and this might bias the resulting sample in a subtle and 
quite uncontrolled way. Besides, samples in the NIR are selected 
in optimal atmospheric windows, free of OH sky lines and max- 
imising the atmospheric throughput, which intrinsically limits the 
redshift range and size of the resulting sample. The only way to 
overcome these issues is to move to telescopes with larger collect- 
ing areas. 

ESO is currently developi ng a Phase B study for a 
European 42-nieter telescope doilmozzi & Spvromiliol I2OO8I : 
ISpvromilioeTaDl2008h . As part of the project development, the 
Design Reference Mission (DRM) aims at producing a set of ob- 
serving proposals and corresponding simulated data which together 
provide the project with a tool to assess the extent to which the E- 
ELT addresses key scientific questions and assist in critical trade- 
off" decision^ In the frame of the DRM eff'ort, simulations of 
3D high-z galaxy observations were undertaken. These observa- 
tions are expected to yield direct kinematics of stars and gas in 
the first generation of massive galaxies (in the range 0.1 < M < 
5xlO^^Mo), as well as their stellar population properties. This will 
allow one to derive dynamical masses, ages, metallicities, star- 
formation rates, dust extinction maps, to investigate the presence 
of disk and spheroidal components and the importance of dynami- 
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cal processes (e.g. merging, in/outflows) which govern galaxy evo- 
lution. These data will also allow one to study the onset of well 
known scaling relations at lower redshifts, and to witness the grad- 
ual shift of star formation from the most massive galaxies in the 
highest density regions to less massive galaxies in the field. 



To assess the science achievement of an NIR IFS on the E- 
ELT, as well as to better understand the interactions between the 
telescope, site, and instrument, we have produced an extensive set 
of ~1000 simulations of observations of very distant galaxies. It is 
important to realize that if one wants to study what kind of instru- 
ment is needed to understand the galaxy mass assembly process, 
then one needs to explore a huge parameter space, i.e., from re- 
laxed smooth or clumpy rotating disks to major and minor mergers, 
taking care of spanning all possible mass ratios, viewing angles, 
merger geometry and so on, which would make such a direct ap- 
proach very difficult, and probably even impossible, on the practical 
side. Moreover, one has to recognise that our knowledge and under- 
standing of the physics of distant galaxies is still incomplete, which 
necessitates to extrapolate some of their characteristics. Therefore, 
the simulations presented in this paper do not intend to be exhaus- 
tive in any sense. Rather, we aim at exploring this very large pa- 
rameter space using reasonable assumptions and guesses. Indeed, 
the DRM exercise consisted in a broad exploration of the parameter 
space in realistic observing conditions, in order to identify possible 
limitations and/or breaking points, as well as interactions between 
the telescope, site, and instrument, which could potentially impact 
the telescope design. 



This paper is the second of a series that explores the perfor- 
mances of a NIR IFS on the E-ELT. In the first paper, we presented 
our methodology to smiulate realistic observations of distant galax- 
ies ( Puech et al.l l2008bl) . We also produced first simulations and 
explored performance using a few scientifically-motivated cases. 
They illustrated the concept of "scale-coupling", i.e., the relation- 
ship between the IFU pixel scale and the size of the kinematic 
features that need to be recovered by 3D spectroscopy in order 
to understand the nature of the galaxy and its substructure. In 
iPuech et al ](|2008b|) , we focused on the largest spatial scales, which 
are of particular interest because they carry most of the kinematic 
information useful to reveal the process underlying galaxy dynam- 
ics, i.e., whether a given galaxy is in a coherent and stable dynami- 
cal state (e.g., rotation), or, on the contrary, out of equilibrium (e.g., 
subsequently to a merger). 



In this paper, we present an updated version of the simula- 
tion pipeline. The main improvement is related to the inclusion of 
thermal emissions from both the IFS and the telescope. This al- 
lowed us to explore a wider parameter space, and especially areas 
where observations are limited by the thermal emission rather than 
by the sky background (e.g., relatively faint targets in the K band). 
We also significantly extended the range of morpho-kinematic tem- 
plates and of physical parameters (e.g., radius, mass, and velocity 
as a function of redshift) considered in the simulations. This paper 
is organised as follows. In Sect. 2, we present our methodology and 
the pipeline used for simulations. In Sect. 3, we detail the scientific 
and observational inputs used in the simulations. In Sect. 4, we 
present the results of the simulations, which are discussed in Sect. 
5. A conclusion is drawn in Sect 6. Throughout, we adopt Hq = 70 
km/s/Mpc, Qm = 0.3, and Qa = 0.7, and the AB magnitude system. 
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2 THE PHYSICS AND MASS ASSEMBLY OF GALAXIES 
OUT TO Z~6 WITH THE E-ELT 

2.1 Goals of the simulations 

Simulations presented in this paper focus on the sub- sample of dis- 
tant emission line galaxies: due to Signal-to-Noise Ratio (SNR) 
limitations, absorption line galaxies at z > 1.5 will more likely 
be studied using integrated spectroscopy (see DRM Science Case 
CI 0-3 "ELT integrated spectroscopy of early-type galaxies at z > 
r0). Therefore, in the following, only emission line galaxies will 
be considered and for convenience, we will sometimes use the sin- 
gle word "galaxies" to refer to "emission line galaxies". 

Spatially-resolved kinematics of such galaxies provides a use- 
ful test-bed for 3D spectrographs, because it drives the stringent 
requirements on the SNR: while flux is the zero-order moment of 
an emission line, the velocity and velocity dispersion are derived 
from the position and the width of emission lines, which are their 
first and second moments, and higher order moments are always 
characterised by larger statistical uncertainties. Hence, simulations 
shall assess the principal scientific goal of spatially-resolved spec- 
troscopy of distant emission line galaxies, focusing on kinematics. 
Diff'erent objectives can be defined depending on the level of ac- 
curacy and/or spatial scale one wants to probe; we defined the fol- 
lowing objectives (dubbed here as "steps"), ordered by increasing 
complexity/difficulty : 

(i) Simple (3D) detection of emission line galaxies: what stellar 
mass can be reached at a given SNR as a function of redshift, AO 
system, environmental conditions, etc.? 

(ii) Recovery of large scale motions (see, e.g., IFlores et al.l 
120061) : what are the conditions under which it is possible to recover 
the dynamical state of distant galaxies (e.g., relaxed rotating disks 
vs. non-relaxed major mergers)? 

(iii) Recovery of Rotation Curves (RC): what are the condi- 
tions under which it is possible to recover the rotation veloc- 
ity Vror (to d erive, dy namical masses or the Tully-Fisher rela- 
tion, see, e.g. JPuech et al. 2006a, 2008) or the whole shape of the 
RC (for derivation a nd decomposition in mass profiles, see, e.g., 
iBlais-Ouellette et aLl[2001i) ? 

(iv) Recovery of the detailed kinematics: what are the condi- 
tions under which it is possible to detect internal structure s in dis- 
tant g alaxies, like clumps in distant disks (see, e.g.. ,Bournaud et al.! 
l2008h ? 

Studying and evaluating in detail the future achievement and 
measurement accuracy of an NIR integral-field spectrograph on the 
E-ELT in these areas is beyond the scope of this paper. Moreover, 
in addition to the scientific limitations pointed out in the Introduc- 
tion, one has to take into account the fact that technical develop- 
ments (both on the telescope and instrument side) are not advanced 
enough to allow us to conduct such a detailed study, since several 
key elements are currently not fully known (e.g., the thermal contri- 
bution from the telescope, coatings, instrument design). Rather, the 
goal of this paper is to restrict the acceptable underlying parame- 
ter space (see below), and identify breaking points on the technical 
side that might strongly impact the achievable science of an IFS an 
the E-ELT 

However, to put this study on as realistic as possible grounds, 
an observational proposal was defined prior to simulations. This 
program was designed to provide us with an ultimate test of galaxy 
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formation theories. It is therefore ambitious, with the goal of ob- 
taining spatially-resolved kinematics of about one thousand of 
galaxies spanning a large range of cosmic time and mass. Cur- 
rent 3D surveys on 8- 10m telescopes already allowed us to sam- 
ple a significant part of the cosmic look-back time, from z=0.4 to 
z~3, i.e., up to ~11 Gyr ago. Main 3D samples are concentrated 
around z~0.6 (see the IMAGES sample, Yang et al. 2008) a nd z-2 
(see the S INS and OSIRIS samples, Forster Schreiber et aT 2009^; 
iLaw et ai]l2009), and it is unlikely that current or event future 3D 
spectrographs on 8- 10m telescopes will provide us with large and 
representative samples beyond z~3, due to limitations in surface 
brightness detection, even when fed with adaptive optics. There- 
fore, we focused on the redshift range 2-5.6, which remains largely 
unexplored, with the advantage of sharing the z~2 limit with exist- 
ing data. The upper limit is driven by the availability of emission 
lines in the NIR window, and corresponds to the [Oil] emission line 
getting out the K band. 

It is worth emphasising that such a 3D survey is by nature 
radically diff'erent from what is currently done at z>l. Because of 
the limited sensitivity of current 3D spectrographs, galaxies are tar- 
geted in optimal windows, where the atmospheric absorption is low. 
This, combined to the collection of selection criteria used at high-z 
to pre- select targets with known redshifts (e.g., BzK, or Lyman- 
break techniques) result in samples whose selection functions and 
representativity have been the subject of a long debate. The only 
way to overcome these limitations is to conduct a deep optical-NIR 
imaging and redshift survey with high completeness up to z~5.6, 
from which a 3D follow-up on the E-ELT will guarantee, without 
any preselection, the representativity of the resulting sample. 

Several integral field spectrographs (IFS) are currently under 
study for the E-ELT. The present program is designed for a multi- 
object IFS, since multiplex is required to reach a significant number 
of targets. For example, EAGLE, the project of multi-integral field 
spectrograph f or the E-ELT, would be a w ell- suited instrument for 
such a survey dCubv etal.|[2008l :l Puech et al. 2009b). However, we 
want to emphasise that the present paper does not intend to explore 
the scientific capabilities of this particular instrument, beyond the 
fact that it is a NIR multi-IFU spectrograph. 

2.2 Simulation pipeline 

The end-products of 3D spectrographs are usually data-cubes in 
FITS format. Hence, we have developed a simulation pipeline that 
produces such mock data. These simulated data are produced as- 
suming a perfect data reduction process, whose impact has there- 
fore been neglected (see Sect. 5.1). 

Data-cubes are simulated using a backward approach of the 
usual analysis of 3D data. During such an analysis, each spaxel 
is analysed separately in order to extract from each spectrum the 
continuum and the first moments of emission lines. One ends up 
with a set of maps describing the spatial distribution of the contin- 
uum and line emissions (zeroth order moment of the emission line), 
as well as the gas velocity field (first order moment) and velocity 
dispersion map (second order moment). Such maps are routinely 
derived from Fabr y-Perot observations of local galaxies (see, e.g, 
lEpinatetal.ll2008h . or can be generated as by-products of h ydro- 
dynamical simulations of local galaxies (e.g.. Cox e t al.ll2006l) . Us- 
ing simple rules and empirical relations, these maps can be rescaled 
(in terms of size and total flux) to provide a realistic description of 
distant galaxies (see Sect. 3.1). Assuming a Gaussian shape for all 
spectra, it is then straightforward to reconstruct a data-cube from 
such a set of four maps, by "reversing" the usual analysis of 3D 
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data. This allows us to produce a data-cube at high spatial reso- 
lution, which can then be degraded at the spatial resolution (AO 
PSF) and sampling (IFU pixel scale) required for simulating data 
produced by a given 3D spectrograph. During the process, realistic 
sky and thermal backgrounds, as well as phot on and detector noise 
are added. The pipeline is fully described in IPuech et al ] (l2008bh 
and lPuech et aP ([2008c). 

The simulation pipeline generates ndit data-cubes with indi- 
vidual exposure time of dit, which are combined by estimating the 
median of each pixel to simulate several individual realistic expo- 
sures. Since we have only included random noise, it is similar to 
having dithered all of individual exposures and combining them af- 
ter aligning them spatially and spectrally. Sky frames are evaluated 
separately (i.e., with a different noise realization), and then sub- 
tracted from each individual science frame, reproducing the usual 
procedure in both optical and NIR spectroscopy. 



2.3 Overview of simulation outputs 

We list here the FITS files generated by the simulation pipeline that 
are relevant for the present study: 

• An IFU data-cube: this is the final product of the simulation, 
which corresponds to mock observations; 

• A total background spectrum (sky continuum, OH sky lines, 
and total thermal background); 

• A thermal background spectrum; 

• An SNR data-cube, which gives the expected SNR within each 
pixel of the IFU data-cube. This spectroscopic SNR is derived as 
follows: 



SNR(i,Jy,k,): 



^JO{ix, jy, kx) + S (ijc, jy, kx) + ron^ + dark 



where 0{ix,jy,kx) and S(ix,jy,A) are respectively the object and 
sky flux per dit (after accounting for atmospheric transmission) 
in the spatial position (i^, jy) of the data-cube (in pixels), and at 
the spectral position along the wavelength axis (in pixels). In 
the following, the "maximal SNR in the emission line in the pixel 
(ix.jyY' refers to MAXk^[S NR(ix, jy,kA)], and the "spatial-mean 
SNR" refers to the average of this quantity over the intrinsic galaxy 
diameter, i.e., the diameter of the galaxy irrespective of what parts 
of the galaxy are detected. 

For each simulation, the main parameters are recorded in the head- 
ers of the corresponding FITS files 

The simulated IFU data-cubes are then analysed using an au- 
tomatic data analysis pipeline similar to those generally used to 
analyse data of high redshift galaxies. During this process, each 
spatial pixel of the simulated data-cube is fitted with a Gaussian 
in wavelength, whose position and width correspond respectively 
to the velocity and velocity dispersion of the gas in this spatial 
pixel. Only pixels with a kinematic signal to noise ratio SNRkin of 
at least three (see definition below) were considered to limit uncer- 
tainties. A more detailed description of the process can be found in 
IPuech et al. ( 2008b) and Puech et aL (.2008c) . The analysis pipeline 
produces the following FITS files: 

• An emission line flux map; 

• A velocity field; 



^ Examples of simulated data-cubes can be downloaded at 
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• A velocity dispersion map; 

• A map of the kinematic SNRkin, which is defined as the total 
flux in the emission line divided by the noise on the continuum 
o'continuum and ^ pix, thc numbcr of pixels within the emission line; 

• A map of the maximal SNR, i.e., the SNR at the peak of the 
emission line within each spaxel of the simulated IFU data-cube. A 
spatial-mean value is written down into the header (see above). 



2.4 Methodology for simulations 

All input parameters can be separated into two broad categories, 
namely the "physical" parameter space, which includes all param- 
eters defining the distant galaxy (i.e., redshift z, galaxy diameter, 
continuum AB magnitude Mab, rest-frame emission line equivalent 
width EWq, velocity gradient, and morpho-kinematic type), and the 
"observational" parameter space, which includes the telescope (pri- 
mary mirror Ml and secondary mirror M2 diameters, temperature 
Tteu and emissivity Stei), the instrument (spectral resolution R, IFU 
pixel size I^pix, detector integration time dit, number of exposures 
ndit, temperatures Tinstr and emissivities Einstr, AO correction, and 
global transmission itransm), and the site (seeing, profile, outer 
scale of the turbulence Lq, sky brightness and atmospheric trans- 
mission). 

Given the very large number of parameters to be investigated, 
as well as the very large range of values to be explored, it is use- 
ful to define a "reference case", around which the parameter space 
can be explored and compared with. As such a reference case, we 
adopted an M* galaxy at z-A observed using MOAO (Multi-Object 
Adaptive Optics, see Sect. 3.2) with a median seeing of 0.8 arc- 
sec. At this redshift, the [Oil] emission line is observed in the H- 
band, where the influence of the thermal background is minimised 
in comparison with the K-band. This makes this reference case as 
independent as possible of the telescope design (e.g., number of 
mirrors), environmental conditions (site selection), and instrument 
characteristics (e.g., number of warm mirrors), which are not all 
fully known at present. 

This reference case will be used to assess separately the influ- 
ence of: 

• the AO correction for a given set of other observational and 
physical parameters; 

• the observational parameters for a given set of physical pa- 
rameters and AO correction; 

• the physical parameters for a given set of observational pa- 
rameters (AO correction included). 



2.5 Metrics and figures of merit 

To decide whether or not the scientific goals have been met, we 
adopted a pragmatic point of view and defined as a general met- 
ric the total observation time of the survey Tintg required to achieve 
the proposal goals, i.e., observe Ngai galaxies more massive than 
M stellar - 10^^ M© in the redshift range 2-5.6. A high number of 
galaxies {Ngai >100) is required to allows us to derive statistics over 
the morpho-kinematic types of galaxies in several redshift and stel- 
lar mass bins (see Sect. 4.6). We require Tintg < 100 nights, which 
roughly corresponds to the total time allocated at ESO to Large 
Programs per VLT per year (i.e., 30% of the available time). There- 
fore, such a survey could reasonably be implemented as a several 
years eff'ort. Finally, this total time should correspond to SNR levels 
that guarantee to reach at least step (ii) for most of galaxies in the 
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survey, and step (iv) (see Sect. 2.1) for a more limited sub-sample 
(to be defined by simulations). 

From a more practical point of view, such a survey will be 
faced with the difficulty that future targets will have to be drawn 
from a larger parent sample because of selection constraints on, 
e.g., avoidance of OH sky lines, which reduces the a vailability of 
redshif t windows at a 30-40% level in H-band (see IPuech et al] 
l2008bh. Other technical constrains might be related to, e.g., the de- 
sign of instrument setups in terms of spectral bandwidth. However, 
the current development of EAGLE, the project of multi-integral 
field spectrograph for the E-ELT, considers the capability of obtain- 
ing the equivalent of large-band filters in terms of spect ral band- 
width in a sinsle observational shoot dCubv etal.|[2008h . Finally, 
the construction of this parent sample will require substantial ob- 
serving time from current or future facilities (e.g., VLT/HAWK-I, 
JWST/NIRCAM, VISTA). Given that the target density at z-4 is 
of the order of one per arcmin^ (down to /a5=25, see Steid el et aP 
[l999l . I2003L defining a parent sample of, say, one thousand of 
galaxies will require deep imaging in the NIR over several square- 
degrees. This will be further discussed in Sect. 5. 
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Figure 2. Schematic illustration of the method used to re-scale the morpho- 
kinematic templates in order to make them match the size and flux of distant 
galaxies. Also, the velocity dispersion (not shown here) can be rescaled 
using a multiplicative value. 



2.6 Test case 

We conducted a special first run to compare the results of the simu- 
lation pipeline with real 3D observation s on the VLT of a galaxy at 
z~2.4 observed bv lGenzel et all (120061) using SINFONI. The goal 
of this run is to use input parameters corresponding to a real ob- 
served case and assess whether or not the simulation pipeline is 
able to produce a data-cube with the same quality, quantified us- 
ing the median SNRkin (see definition in Sect. 2.3). For the z~2.4 
galaxy observed with SINFONI, the corresponding input parame- 
ters are: galaxy diameter of 0.8 arcsec, K=21.47, EWo(HQf)=140A, 
integration time T/„^^=6 hr, pixel size of 50x100 mas^, PSF with 
FWHM=150 milli-arcsec (mas), temperature of the telescope and 
instrument of 287K (VLT), emissivity of the telescope of 6% 
(Cassegrain focus), emissivity of SINFONI of 15%. We took care 
of mimicking the SINFONI data reduction procedure by interpolat- 
ing pixels from the physical 50x100 mas^ spatial scale down to the 
50x50 mas^ fin al scale, and smoo thing the data-cube to a resolu- 
tion of 190 mas (iGenzel et al.|[2006 ) . The results of this validation 
run are shown in Fig. [T] and demonstrate that, given a complete 
set of observational and physical parameters, simulations can pro- 
duce data-cubes quantitatively similar to real observations. More- 
over, the similarity between the observed velocity field and velocity 
dispersion map (see first panel in Fig.[T]) and those produced by the 
automatic analysis pipeline (see second panel) shows that the au- 
tomatic analysis pipeline can be used safely and does not add too 
much uncertainty on the derived kinematics compared to a more 
careful visual examination and fitting of the data-cube. 



3 SIMULATIONS 
3.1 Scientific inputs 

Scientific inputs are needed in order to (1) provide the simulation 
pipeline with morpho-kinematic templates and (2) re- scale these 
templates as a function of realistic distant galaxy sizes, fluxes, and 
velocity gradients. A global flowchart of the rescaling procedure is 
shown in Fig. [2 while specific details are given below. 

Morpho-kinematic templates: Table [T] summarises the main 
properties of the high resolution templates used for the simulations. 



which are shown in Fig. [3] and |4] These templates were obtained 
from both re al observatio ns (Fabry-Perot interferometry of loca l 
galaxies, see lEpinat et al.l [2008 and Fuentes-Carrera et all 2004 ). 
and hydro-dynamical sirn ulations of galaxies fsee lCox et al.ll200^ 
an d iBournaud etaDl2008h . 

Redshift: Given the objective of the present study (see intro- 
duction and Sect. 2. 1), and as a compromise between the number of 
redshift and mass bins at constant total number of targets, three red- 
shifts were considered for simulations. Table |2] summarises these 
redshifts with the corresponding targeted emission line: we chose 
z=2 (look-back time of 10 Gyr), z=4 (look-back time of 12 Gyr), 
and z=5.6 (look-back time of 12.6 Gyr), which samples well look- 
back times above z=2. As stated in Sect. 2.3, this choice was not 
driven by any consideration of optimal atmospheric transmission, 
but purely from scientific grounds. The impact of this selection will 
be discussed in Sect. 5.1.2. 

For simplicity, the [Oil] emission line was assumed to be a 
single line centred at 3727A; this allows us to avoid introducing 
an additional parameter, i.e., the line ratio between the lines of 
the doublet, which depends most ly on the electron density in the 
medium fe.g. JPuech et al ] |2006ah . This does not influence the in- 
tegrated SNR over the emission line but leads to overestimate the 
maximal spectroscopic SNR in the emission line. The impact of 
this assumption will be investigated in Sect. 5.1.1. Note that the 
highest redshift considered (z=5.6) corresponds to the limit above 
which [Oil] gets redshifted out of the K-band. 

Stellar mass and flux: We used the MUSIC c ompilation of 
publi c spectro-photometric data in the GOODS field jGrazian et al.l 
l2006h to derive empirical relations between redshift, observed K- 
band magnitude, and stellar-mass Msteiiar (see Fig. [5]). The latter 
quantity was expressed as a fraction of the characteristic stellar- 
mass M*(z) at a given redshift, which describes the knee of the 
Galaxy Stellar Mass Function (GSMF) at this redshift, according to 
a Schechter function. In other words, at a given z, galaxies with stel- 
lar mass Msteiiar - M^(z) are those which contribute the most to the 
stellar mass density at this redshift. Table |2] gives the correspond- 
ing stellar masses in the simulations as a function of redshift. The 
pseudo-continuum flux around the emission line is derived directly 
from the K-band magnitude. To avoid any additional parameter, we 
did not apply any "colour" correction between the rest-frame wave- 
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z=2.3834 Rgal-0.8" 
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Median SNR=4.0 

Figure 1. Comparison of SINFONI data with simulations of a rotating disk (UGC5253 template, see Sect. 3.1) using the same set of observational and physical 
parameters (see text). First panel: SINFONI observations of a z~2.4 galaxy (left: velocity field; right: velocity dispersion map), reproduced from Genzel et 
al. (2006). Second panel: analysis of the reduced SINFONI data-cube (courtesy of N. Forster-Schreiber) using our own automatic analysis pipeline (see Sect. 
2.4). From left to right: velocity field, velocity dispersion map, Ha map, and SNRkin map. Third panel: simulations of a rotating disk using the observational 
and physical parameters corresponding to the real SINFONI observations. The median SNRkin is found to be very similar in both cases. 



Name 


Morphological type 


inc (deg) 


Mb or M stellar 


z 


Comments 


Reference 


UGC5253 


Sab 


40 


-20.8 


0.00441 


Rotating disk 


Garrido et al. (2002) 


UGC6778 


SABc 


30 


-20.6 


0.003226 


Rotating disk 


Garrido et al. (2002) 


UGC7278 


Im 


44 


-17.1 


0.00097 


No rotation 


Garrido et al. (2004) 


UGC7592 


IBm 


64 


-17.8 


0.00069 


No rotation 


Garrido et al. (2004) 


ARP271 


SAc-SBc 


59/32 


-20.6/-21.2 


0.0087 


Merging pair 


Fuentes-Carrera et al (2004) 


Major merger 


Sbc-Sbc 


69 




0.0 


Simulation 


Cox et al. (200^ 


Clumpy Disks 




50 




1.0 


Simulations 


Bournaud et al. (2008) 



Table 1. Main properties of the morpho-kinematic templates used for the simulations. The first five templates are real observations, while the two last are hydro- 
dynamical simulations. From left to right: Name, morphological type, morphological inclination, absolute B-band magnitude or stellar mass for simulations, 
redshift, and comments. 



length of the emission line (e.g., 0.3727 jjim for [Oil]) and that of 
the K-band (e.g., 0.44 jim at z-A). Such an assumption is consis- 
tent with Spectral Energy Distributions (SED) of galaxies with a 
mor phological type latte r than Sa within a factor two in flux (see, 
e.g., iKinnevetalJI 19961) . As a reference case, we adopted a z=4, 
galaxy (see Sect. 2.4). 

Rest-frame emission line equivalent width: We assumed 
EWo([OII1)=3 0A, which is an ex trapolation of the mean value 
found at z=l (lHammeretal.1 [19971) . This parameter does not in- 



fluence the emission line flux distribution of the galaxy but is used 
to set its total integrated value (see Puech et al. 2008b). 

Size: We used empirical relations between redshift and half- 
light radius from the literature. To mitigate the impact of the dif- 
ferent sample selection criteria used at high redshift, we average 
the difl'erent values found in the literatur e (Bouwens et al. 2004; 
iFerguson et al.! 1200^ : IPahlen et al.! l2007b . The resulting "mean" 
ha lf-light radius was t hen k-corrected using the empirical relation 
of iBarden et al.l (l2005h . and re- scaled to the assumed stellar-mass 
using the local scaling between the K-band luminosity (used as a 
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Figure 3. Morpho-kinematic templates used for the simulations. From left to right: continuum map, emission line map, gas velocity field, and gas velocity 
dispersion map. For the major merger simulation, the first two maps are the stellar surface density and the gas surface density maps (see Tab.[T]for details). 
Spatial scales are in arbitrary units since the templates are rescaled in terms of distant galaxy sizes (see text). 



proxy for stellar mass) and size reported bv lCourteau et alJ (l2007h . 
i.e., Rhaif oc L^j^^ . The total size (diameter) was assumed to be four 
times the half-light radius. Note that in the case of a exponential 
thin disk of scale-length R^, R25 = 3.2i?j and Rhaif = 1.68/?j, which 
approximately leads to R25 ~ 'IRhaif- The diameters (in arcsec) used 
in the simulations are given in Tab. [21 

Velocity gradient amplitude: The velocity amplitude of the 
velocity field was r escaled using the local stellar-mass Tully-Fisher 
relation, following iHammer et al ] (l2007h (see Tab. [2]). Note that 
strictly speaking, this procedure is quantitatively correct only if ap- 
plied to rotating disk morpho-kinematic templates. 



3.2 Observational inputs 



Site and sky background: A Paranal-like site is assumed with 
T^,Ye=280K. Atmospheric absorption is modelled following a 
Paranal-like site, although at a lower spectral resolution than the 
official current model (see bottom panel on Fig. [6]). Sky emission 
(continuum and OH lines) was accounted for using a model from 
Mauna Kea, which has the advantage of also including zodiacal 
emission, thermal emission from the atmosphere, and an average 
amount of moonlight. This Mauna Kea model is two times fainter 
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Figure 4. Morpho-kinematic templates from simulations of clumpy disks viewed from different azimuthal angles (from top to bottom: 0, 45, 90, and 135 deg). 
From left to right: stellar surface density map, gas surface density map, gas velocity field, and gas velocity dispersion map (see Tab. [T] for details). Spatial 
scales are in arbitrary units since the templates are rescaled in terms of distant galaxy sizes (see text). 



z 


Look-back time (Gyr) 


Emission line 


Observed broad-band 


Quantity 


O.IM* 


0.5M* 


l.OM* 


5.0M* 


lO.OM* 


2.0 


10 


Ha /16563A 


K 


logiM stellar /Mq) 


10.2 


10.9 


11.2 


11.9 


12.2 










Kab 


24.7 


23.0 


22.2 


20.5 


19.7 










AV (km/s) 


160 


210 


260 


350 


430 










Size (arcsec) 


0.68 


1.19 


1.52 


2.67 


3.40 


4.0 


12.0 


[OII]i3727A 


H 


log(M stellar /Mq) 


9.7 


10.4 


10.7 


11.4 


11.7 










Kab 


26.8 


25.1 


24.3 


22.6 


21.8 










AV (km/s) 


130 


180 


200 


300 


330 










Size (arcsec) 


0.33 


0.59 


0.75 


1.3 


1.7 


5.6 


12.6 


[OII]i3727A 


K 


logiMstellarlMo) 


8.9 


9.6 


9.9 


10.6 


10.9 










Kab 


27.8 


26.0 


25.3 


23.5 


22.8 










AV (km/s) 


90 


110 


140 


200 


240 










Size (arcsec) 


0.28 


0.50 


0.63 


1.11 


1.41 



Table 2. Redshifts (and look-back times) considered for simulations, with the corresponding emission line targeted, and the broad-band filter within which the 
redshifted emission line falls. On the right side: Stellar mass, K-band magnitude, velocity gradient, and size (defined as four times the half-light radius) of the 
simulated galaxies as a function of redshift. 
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z 



Figure 5. Empirical relations between the observed K-band magnitude and 
redshift used to re-scale the morpho-kinematic templates to make them 
match the continuum flux of distant galaxies at the corresponding stellar- 
mass. Black points were derived from fitting the luminosity function and 
mass function of galaxies in the MUSIC survey (see text). The variation of 
M* with redshift, i.e., the knee of the GSMF, is given by the formula on 
the bottom. The upper formula gives the empirical evolution of the K-band 
magnitude of an M* galaxy as a function of redshift. At a given z, one finds 
that^A5 = 20.18 -F6.831ogz- 2.5 logMJMo. 




X[/xm] 

Figure 6. Atmospheric model used for the simulations. Upper panel: sky 
emission spectrum from the Gemini ETC (see text). The dashed-red line 
is the official DRM model, which is two times brighter in H band. Bottom 
panel: Atmospheric model from the VLT ETC. 

in H-band than a Paranal-like site (see upper panel on Fig. (6]). The 
influence of the sky background will be discussed in Sect. 5.20 

Telescope model: We followed the official 42 meter E-ELT 
5 -mirror design. Its thermal background was modelled using a gray 

^ Seelhtt p y/www.eso.org/sci/facilitie s/eelt/science/drm/tech-data/i for a de- 
tailed comparison of the two models. 



body, and assuming an emissivity of 5% {Ag+A\ coating). We ne- 
glected the central aperture, as it is not included in the modelling 
of the PSFs (see below). The collecting area of the telescope is 
therefore 8% larger that the official baseline. This will be further 
discussed in Sect. 5.1.1. 

Instrument model: We assumed a reference pixel size of 50 
mas, and a reference spectral power of resolution of R=5000, as a 
compromise between our desire to minimise the impact of the OH 
sky lines and not wanting to over-resolve the line by a large factor. 
The instrument thermal background was modelled using two gray 
bodies, following a preliminary study of the thermal background 
of EAGLE (Laporte et al. 2008). The first one models the eflfect of 
the Target Acquisition System, and assumes a temperature of 240K 
and an emissivity of 15%. The second one models the effect of the 
spectrograph, and assumes a temperature of 150K and an emissiv- 
ity of 69%. The instrument background represents less than 10% of 
the telescope background in both the H and K bands (see Fig.|7]), in 
agreement with the official requirements for EAGLE, which spec- 
ify that "the instrument thermal background at the detector shall be 
less than 50% of that from the telescope" (goal=10%). As we will 
see below, the total background is dominated by the thermal con- 
tribution of the telescope in the K band, and by the sky background 
in the H band. Therefore, we did not explore variations in the ther- 
mal contributions from the instrument, which is never the dominant 
source of background. 

Detector model and exposure time: We rely on the de- 
scription of a cooled R ockwell HAWAII-2R G IR array working at 
~80K, as described bv IPinger et alj (l2006h . with a read-out-noise 
(RON) of 2.3 e/pix and a dark current of 0.01 e/s/pix. Its thermal 
background, bias, and saturation threshold were neglected. Because 
observations of distant galaxies in the NIR are generally not lim- 
ited by the detector noise, we did not explore parameters that have 
a non-negligible influence only in such a regime, i.e., variations of 
the individual frame exposure time dit, and detector characteris- 
tics (dark and RON). In practice, we chose J/^=3600s to speed up 
the simulations while keeping a reasonable number of individual 
exposures, although realistic exposures will be much shorter be- 
cause of sky variations and saturation of the detector. We assumed 
a reference case exposure time of Ti^tg = dit x ndit-2A hours, i.e., 
ndit-2A. 

Global throughput: We assumed a global throughput 
^transm-20%, dctcctor QE includcd. The official DRM baseline as- 
sumes a transmission of 90-95% for the 5 mirror design telescope 
and a QE of 90%. The official requirements for EAGLE, specifies 
a throughput of at least 35%, including the detector QE. Therefore, 
according to the baseline, the global throughput should be 31%, 
which is larger that our assumption. This will be further discussed 
in Sect. 5.1.1. The integrated number of photons reaching the detec- 
tor is a degenerated function of some parameters which have no im- 
pact on the spatial or spectral resolution, i.e., Tintg, ttransm^ EWq, and 
D. Therefore, we chose not to explore variations in global trans- 
mission, which can be directly derived by analogy with variations 
of associated degenerated parameters. 

PSF model: The coupling between the AO system and the 3D 
spectrograph is captured through the AO system PSF. Therefore, it 
is a crucial element that needs to be carefully simulated, and can- 
not be approximated by, e.g., a simple Gaussian. Given th e multi- 
plex r equirements for the present science case, only GLAO (|Rigaut| 
l2002h and MOAO jAssemat et aLll2007h have been considered. All 
PSFs include the eflfect of telescope aperture, but neglect the eflfects 
of the central obscuration and spiders. They were generated at the 
central wavelength of the corresponding filter (e.g., 1.65 microns 
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Figure 7. Contributions to the total background in the simulations, at z=4 (upper panel) and z=5.6 (bottom panel). Left column: comparisons between the sky 
contribution (continuum and OH sky lines, in black) and the total thermal background (telescope and instrument, in red). Right column: comparison between 
the telescope (black) and instrument (red) contributions to the total thermal background. 



for the H band), and the difference between this wavelength and 
the actual wavelength where the emission line is observed (e.g., 
1.86 jim for [Oil] redshifted at z-A) was neglected. 

GLAO PSFs were taken from the official DRM ftp deposi- 
These PSFs were generated by M. Le Louarn at ESO us- 
ing an end-to-end code and the official DRM baseline assumptions 
(vl). Of note, these PSFs correspond to a total integration time of 
4 seconds and therefore include non-negligible speckle noise. 

MOAO PSFs were simulated using an analytical code (cal- 
ibrated against an end-to-end code) by B. Neichel (GEPI-Obs. de 
Paris/ONERA) and T. Fusco (ONERA), which has the advantage of 
producing PSFs free of speckle noise, i .e., more representative of 
long-exposure PSFs (iNeichel et al.l2008h . Briefly, we used assump- 
tions as close as possible to the assumptions used for GLAO PSF 
modelling: the pitch (inter- actuator distance) was assumed to be 
0.5m (~84x84 actuators in the pupil plane), with a reference seeing 
of 0.8 arcsec and an outer-scale of the turbulence Lo=25m; the same 



^ http ://www. eso . org/sci/facilities/eelt/science/drm/tech_ data/ao/ 



10 turbulent layer profile that the one used for GLAO PSFs was 
considered. The wavefront was measured using three guide stars 
(assumed to be natural guide stars, i.e., specific issues related to 
laser guide stars like, e.g., the cone effect, were neglected), located 
on a equilateral triangle asterism. 

In order to sample different performances for the MOAO and 
GLAO systems, different asterism sizes were considered. The PSFs 
were systematically estimated on-axis. A detailed analysis of the 
coupling between the MOAO and 3D spectroscopy can be found 
in Puech et al. (2008b). Of interest here is that in most cases, the 
coupling between the MOAO system and the IFU pixel scale is 
such that the spatial resolution is set by the latter because the PSF 
FWHM is smaller than twice the IFU pixel scale. Improving the 
MOAO correction further does not provide any gain in spatial res- 
olution but still improves the Ensquared Energy (EE) in a spatial 
element of resolution (i.e., the fraction of light entering a spatial el- 
ement of resolution), hence the SNR. This justifies the usual choice 
of c haracterising the EE in an aperture equal to twice the pixel size 
(see IPuech et al.ir2008bl for details). The characteristics of the dif- 
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GLAOFoV(') 


EE in 100 mas 


MOAOFoV(') 


EE in 100 mas 


1 


15.0 





63.6 


2 


12.5 


0.3 


56.1 


5 


8.2 


0.5 


45.6 


10 


6.0 


1.0 


33.7 


15 


5.3 


2.0 


27.2 






3.0 


24.1 






4.0 


23.1 






5.0 


22.7 



Table 3. Sets of GLAO and MOAO PSFs generated for the simulations. 
The asterism size (dubbed "FoV", i.e., the position of guide stars) is varied 
in order to produce a range of EE sampling typical corrections. All values 
were measured in the H-band with a seeing of 0.8 arcsec. 



ferent PSFs used in this study are liste d in Tab.O A detaile d study 
of MOAO PSF shapes can be found in lNeichel et aP (l2008h . 

We chose reference PSFs for both AO systems as close as 
possible of the middle of their range of EE performances: the H- 
band MOAO reference PSF has EE=45.6% (in 100 mas), while the 
H-band GLAO reference PSF has EE=8.2% (in 100 mas). Unless 
stated otherwise, these two PSFs are those used in the simulations. 



3.3 Simulation runs 

Unless stated otherwise, parameters are by default set to their ref- 
erence values as described above. 

Influence of AO correction: We explore the influence of the 
AO correction by increasing the EE in a given aperture as described 
in Sect. 3.2 (see Tab. [3]), for two types of AO systems, i.e., GLAO 
and MOAO. Simulations were done for all the morpho-kinematic 
templates in the MOAO case, and for the two UGC5253 and Major 
merger templates in the GLAO case. 

Influence of observational and physical parameters: Simu- 
lations were performed for five distinct stellar masses, as described 
in Tab. [3 and in two diff'erent runs, as described in Tab.lU 

The simulations listed in Tabs.[2l[3l and|4]correspond to a total 
of about 760 individual runs. 



4 RESULTS OF SIMULATIONS 
4.1 Influence of AO correction 

The spatial-mean SNR obtained for the reference case (H band) as 
a function of Ensquared Energy in a 100 mas aperture (i.e., twice 
the reference pixel size, see above) is plotted in Fig. [S] For com- 
parison, we also plotted results using the major merger template. 
This figure shows that, all simulations approximately fall along the 
same curve, independently of the AO type, although diff'erent codes 
(end-to-end vs. analytical) were used (see Sect. 3.2). This means 
that, considering a given morpho-kinematic template, all the dif- 
ferent PSFs can be safely compared. For comparison to the GLAO 
and MAO PSFs described in Sect. 3.2, we have added MCAO PSFs 
(with 3DMs and asterisms of 0.5 and 5 arcmin and corrections de- 
rived at 0, 0.5,2, and 2.5 arcmin away from the centre of the FoV) 
and LTAO PSFs (FoV of 45 arcsec, on axis). Details about how 
these PSFs were generated can be found in Neichel et al. (2008b). 
The PSFs picked up as representative performances of the GLAO 
and MOAO modes (see Sect. 3.2) have been indicated by red ar- 
rows. In the H band, they can be considered as well representative 
of the AO performance within a range of ~ +2 in SNR for MOAO, 
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Figure 8. Spatial mean SNR obtained for the reference case (H band) as 
a function of Ensquared Energy in a lOOmas aperture for different type of 
AO systems. The black points correspond to a rotating disk template, while 
red points correspond to a major merger template. The left most symbols 
represent the seeing-limited case. The GLAO and MOAO Reference Cases 
(see Sect. 3.2) are indicated by red arrows. 



and ~ +3 for GLAO. A systematic study of the impact of EE on 
the kinemat i cs of d istant galaxies using MOAO can be found in 
IPuech et all (l2008bh . Of interest here is that the reference MOAO 
PSF, with an EE of 45.6% in an aperture of 100 mas is above the 
minimal requirements derived in this study. 



4.2 DRM Steps 

In the rest of this section, we investigate the impact of the instru- 
ment, site, and telescope, on the scientific capabilities of the E-ELT 
equipped with a NIR-IFS in each of the DRM goals/steps as defined 
in Sect. 2.1. 



4. 2. 1 DRM STEP 1 : 3D detection 

We adopt a lower limit in SNR of S NRmin - 5 (spatial-mean SNR 
in the emission line, see Sect. 2.3) for the 3D detection of distant 
galaxies. The SNR obtained as a function of the observational pa- 
rameters and mass was studied in details in IPuech etaLl(l2008ch .In 
this study, we found that the minimal SNR scales as follows: 



24h) U2mA30AjU000 



SOmas J 



This scaling corresponds to what is expected in a background- 
limited regime, and applies to all morpho-kinematic templates. As 
discussed in Sect. 3.2, Ti^tg and the total throughput (not explicitly 
explored by simulations) follow the same scaling. 

Figure |9] gives the stellar mass that can be reached as a func- 
tion of redshift for MOAO corrections, a seeing of 0.8 arcsec, and 
S NRmin = 5 (black line). In this plot, error-bars represent the range 
of stellar mass derived considering all the morpho-kinematic tem- 
plates. We adopted the middle of this range as a simple estimator of 



12 M.Puechetal 



RUN #1 RUN #2 



z 


(mas) 


Morpho-kin. 
Template 


EWo 
(A) 


R 


D 

(m) 


z 


Tintg 

(hr) 


(mas) 


AO 
correction 


Morpho-kin. 
Template 


Seeing 
(arcsec) 


4 


25 


UGC5253-Major merger 


30 


5000 


42 


2 


8 


50 


MOAO 


ALL 


0.8 




50 


UGC5253-Major merger 


30 


5000 


42 




24 


50 


GLAO 


ALL 


0.8 




50 


UGC5253-Major merger 


15 


5000 


42 








MOAO 


ALL 


0.8-0.95 




50 


UGC5253-Major merger 


30 


10000 


42 


4 


8 


50 


MOAO 


ALL 


0.8 




50 


UGC5253-Major merger 


30 


2500 


42 




24 


50 


GLAO 


ALL 


0.8 




75 


UGC5253-Major merger 


30 


5000 


42 








MOAO 


ALL 


0.8-0.95 




50 


UGC5253-Major merger 


30 


5000 


30 


5.6 


8 


50 


MOAO 


ALL 


0.8 



24 50 GLAO ALL 0.8 

MOAO ALL 0.8-0.95 



Table 4. Observational and physical parameters investigated in the first and second run of simulations. All other parameters are set to their reference values as 
described in Sect. 3. 




Figure 9. Left: Stellar mass limit that can be reach at a given SNRmin as a function of redshift, AO correction, and seeing. Error-bars represent the range of 
mass limit obtained when considering all the morpho-kinematic templates. The vertical dashed line shows the limit above which the [Oil] emission line gets 
redshifted into the K band. Middle: Zoom of the redshift range [2,4]. Long-dashed lines show iso-magnitude curves. For clarity, only a subset of all the curves 
shown in the main panel are reproduced here, and the range of mass limit is shown only for the MOAO cases with 0.8 arcsec seeing. Right: Comparison 
between the median emission line surface brightness of the simulated galaxies (in red; the error-bars represent the values found in the 0.1-lOM* range) with 
that from the thermal background (telescope and instrument, blue lines) and total background (thermal plus continuum and OH sky lines, black lines). For 
saving computation time, only narrow spectral ranges around the targeted emission lines were simulated for the three redshift cases. 



the stellar mass one can reach at a given redshift under the different 
specific observational conditions considered in this plot (see Fig. 
[9]). One can see that the limit in stellar mass grows exponentially 
with redshift: 



Ml 



— ^ 3.3 X 10"^ exp 



z 

037 



which is due to the exponentially increasing contribution of the 
thermal background from the telescope in the NIR (see right panel 
in Fig. [9]). Noteworthy, the mass limit evolves quite smoothly with 
redshift up to z~4-4.5: this means that this limit does not depend 
very strongly on input parameters like, e.g., seeing (see the blue 
curve for a 0.95 arcsec), or AO correction (see the red curve for 
GLAO corrections). It is not surprising that jVlOAO and GLAO 
give similar results, as the SNR considered here is a spatial-mean 
over the galaxy diameter at constant spatial and spectral sampling, 
which does not take into account dififerences in terms of spatial res- 
olution (see next Section). One can adopt SNRmin = 10 (instead of 
5) without impacting too strongly the resulting mass limit. 

It is useful to summarise Fig.|9]by deriving the redshift up to 
which the Galaxy Stellar Mass Function (GSIVIF) can be probed 



down to M*, as galaxies having such a stellar mass at a given z are 
those contributing the most to the stellar mass density at this red- 
shift. From Fig. 121 one can see that the GSMF can be probed down 
to up to z^4, independently of the SNR limit, AO correction, and 
seeing conditions. 



4.2.2 DRM STEP 2: Large scale motions 

As detailed in IPuech et al ] (l2008bh . large scale motions carry im- 
portant information about the dynamical nature of galaxies. In this 
section, we focus on the two extreme cases of morpho-kinematic 
variations found amongst the galactic zoo, which are regular rotat- 
ing disks as opposed to major mergers. Indeed, because the latter 
are the most violent galactic events, they are expected to result in 
the largest morpho-kinematic amplitudes. In this section, we con- 
sider only the UGC5253 and the major merger templates to illus- 
trate how large-scale motions can be used to distinguish between a 
rotating disk and a major merger using spatially-resolved kinemat- 
ics on the E-ELT. 

It is difficult to build a simple criterion that would allow us to 
assess the quality of the recovered large-scale motions, and quan- 
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17.89-16.46 


10.0 


32.24-33.27 


24.21-23.46 


^stellar 


SNR w/ MOAO 


SNR w/ GLAO 


(inM,(z)) 


z=5.6 


z=5.6 


0.1 


0.78-0.44 


0.37-0.18 


0.5 


1.49-1.23 


0.90-0.60 


1.0 


2.07-1.60 


1.28-0.86 


5.0 


3.92-3.75 


2.76-2.20 


10.0 


5.14-5.13 


3.66-3.15 



Table 5. Spatial-mean SNR obtained for simulations of the reference case 
as a function of mass and redshift, for MOAO (upper part) and GLAO cor- 
rections (bottom part). The first value corresponds to simulations with a 
rotating disk template (UGC5253), while the second value corresponds to 
simulations with a major merger template. 
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Figure 11. Results of major merger simulations at z=5.6 using MOAO. 
From top to bottom: velocity fields (first panel), velocity dispersion maps 
(second panel), and emission line maps (third panel) for diff'erent stellar 
masses (from IM*, first column, to lOM*, last column). Lower mass sys- 
tems are not detected. The size of the stamps is, from left to right: 0.71, 1.35, 
and 1.7 arcsec. On the rightest side, we have reproduced the input templates 
shown in Fig.[3]to ease the comparison with simulations, although the spa- 
tial scales are diff'erent. 



tify how well one can distinguish between a rotating disk and a 
merger in any situation. At first order, such a criterion is a func- 
tion of the number of spatial element of resolutions available, and 
of the surface brightness detection limit. Present methods to distin- 
guish between rotating disks and mergers are indeed l imited to spe- 
cific range of SNR and/or spatial resolution (see, e.g., 'Shapi ro et al.l 
I2OO8: Flores et al. 2006). Finding a method for analysing the veloc- 
ity fields of distant galaxies with a large range of spatial resolution 
and/or SNR in a uniform way will be challenging and is far beyond 
the scope of this study. 

Here, we adopted a simple criterion based on a threshold on 
the spatial-mean SNR over the intrinsic galaxy diameter, as defined 
in Sect. 2.3. Using MOAO, it has been suggested that a spatial- 
mean SNR of 5 is a minimum to recover large-scale motions and 
distin guish between a major merger and a rotating disk JPuech et aP 
l2008bi) . It is clear that such a criterion is generally too simplistic to 
allow us to distinguish between a merger and rotating disk in any 
possib le case encountered in nature (see discussion in lPuech et al.l 
However, it captures the two required basic dependencies 
on surface brightness detection (i.e., only pixels having a kinematic 
SNRkin larger than 3 are considered for the kinematic analysis, see 
Sect. 2.3) and on the number of available element of spatial res- 
olutions (i.e., the spatial-mean SNR is derived over the intrinsic 
galaxy diameter independent of what parts are detected, see Sect. 
2.3). Observations of z~0.6 galaxies using FLAMES/GIRAFFE at 
the VLT have demonstrated that only 3 spatial element of reso- 
lution can already allow us t o distinguish between a rotating dis k 
and more complex systems dFlores et "^1 120061 : lYang et al.|[2008h . 
We checked that in 98% of the simulations, a spatial-mean SNR of 
at least 5 corresponds to at least 3 spatial elements of resolution. 
Therefore, this simple criterion provides us with a useful guidance 
on data quality for kinematic classification, well within the scope 
of the present study. 

In Table [5] we give the SNR obtained for a range of stellar- 
mass using the reference case and MOAO or GLAO corrections. 
According to this table, it should be feasible to distinguish between 
both kinds of templates down to 0.5M* at z=4 using MOAO. A 
direct visual inspection of the simulations confirms this threshold 
(see Fig. [To]). 



Using GLAO, Table[5] suggests that the same distinction could 
be made down to M* at z=4 (instead of 0.5M* using MOAO). Since 
GLAO corrections smooth out many more kinematic details than 
MOAO, it is difficult to distinguish a major merger from a rotating 
disk down to the same limit in stellar mass/SNR: it is indeed diffi- 
cult to visually distinguish non-circular motions in the 0.5M* major 
merger simulations at z=4 with GLAO, as shown in Fig.fTOl 

At z=5.6, the much higher background makes any distinction 
very difficult, and at best limits this exercise to the highest mass 
galaxies, as suggested by Tab. [5] and Fig.fTTI 

In summary, provided a SNR of 5, these simulations suggest 
that it is possible, using MOAO, to recover large scale motions of 
galaxies and distinguish between different dynamical states at least 
up to z-4 and down to Msteiiar-0.5M^. GLAO appears to provide 
very similar performances, although at lower spatial resolution, 
which would limit the recovery of the dynamical state down to ~M* 
at z-4. At higher redshift, the loss of SNR induced by the increasing 
contribution of the thermal background of the telescope, will limit 
the recovery of large-scale motions to very massive systems. 

4. 2. 3 DRM STEP 3: Rotation Curves 

In Figure[T2l we show rotation curves extracted from simulations at 
z=2 and z=4 using the UGC5253 rotating disk template. This tem- 
plate has the steepest velocity curve gradient among the morpho- 
kinematic templates, and is therefore useful to assess at which ex- 
tent which parts of the rotation curve can be recovered safely. In- 
deed, the central part of the rotation curve is well-known to be 
strongly affected by limited spatial resolution, an effect known for 
long as "beam smearing" by HI observers. Among the dynamical 
paramete rs fitted to derive a rotation curve from a velocity field 
(see, e.g., lEpinatetalJ2008l for details), inclination is by far the one 
that carries the largest uncertainty. Indeed, there is a well known 
degener acy between inclin ation and rotation velocity during such a 
fitting. lEpinat et all (l2009bh have extensively studied these degen- 
eracies as a function of beam smearing using artificially redshifted 
galaxies at z=1.7. These authors concluded that the best strategy is 
to use high-resolution broad-band imaging to derive a morphologi- 





Figure 10. Results of simulations at z=4 with MOAO for the reference case (upper panels; the size of the stamps is, from left to right, 0.55, 0.95, 1.2, 2.1, 
and 2.7 arcsec for the rotating disk template, and 0.4, 0.7, 0.9, 1.1, and 2.0 arcsec for the major merger template) and GLAO (bottom panels). Each panel 
shows (from top to bottom the velocity field (first sub-panels), the velocity dispersion map (second sub-panels), and the emission line map (third sub-panels) 
for diff'erent stellar masses (from O.IM*, first column, to lOM*, last column). The two panels on the left show simulations using a rotating disk template 
(UGC5253), while the two panels of the right show simulations for a major merger. On the leftest and rightest sides, we have reproduced the input templates 



The Physics and Mass Assembly of distant galaxies with the E-ELT 15 



cal estimate of the inclination, and use this to relax one parameter 
during the kinematic model fitting. We therefore did not try to fit 
the inclination, which was held constant during the velocity field, 
and focused instead on the influence of difl'erent AO modes in re- 
covering spatial features of the rotation curve. In other words, we 
assumed that high-resolution imaging will be available to provide 
the observer with unbiased estimated of this parameter. We adopted 
a simple arctan rotation curve m odel, which is fully described by 
two parameters (ICourteaulll997h . 

At z=2, only MOAO can more or less recover the rising part 
of the rotation curve (RC). Due to its much lower spatial resolu- 
tion (FWHMGLA0=161mas and FWHMMOAo^Hmas), GLAO in- 
duces a strong beam smearing-like efl'ect, as in HI observations 
of local galaxies. Even with MOAO, the spatial sampling of the 
IFU (50mas/pix) limits the spatial resolution of the observations 
(to ~0. 1 arcsec) and does not allow us to recover the true shape of 
the RC. Compared to the best RC that one can recover at this spatial 
scale (see the black dashed line), MOAO and even GLAO do not 
induce any bias in terms of rotation velocity. However, numerical 
simulations will be needed to recover the true rotation velocity, as 
it is already the case in lower redshift kinematic studies of distant 
galaxies (see , e.g., Flores et al. 2006; Forster-Schreiber et al. 2006; 
IPuech et al.ll2008l) . 

At z=4, the lack of spatial resolution will limit the study of 
rotation curves to super-M* rotating disks, as shown in Fig. [12] At 
these redshifts, only MOAO will provide enough spatial resolution 
to limit biases in recovering the rotation velocity or the rising part 
of the RC. 

In summary, the recovery of the whole shape of the RC will 
propably be limited to z^2, using MOAO. The recovery of the true 
rotation velocity will require numerical simulations, which is al- 
ready the case in lower-z studies. At higher redshifts, such measure- 
ments will probably be limited to super-M^ galaxies. Using GLAO 
strongly degrades the shape of the recovered RC and probably pre- 
vents any accurate recovery of its true shape. 

4. 2. 4 DRM STEP 4: Detailed kinematics 

To illustrate the capability of the E-ELT in recovering the detailed 
kinematics of distant galaxies, we consider in this section the re- 
sult of simulations with the Jeans-unstable clumpy disk templates 
shown in Fig. [13] Clumps can clearly be distinguished in galax- 
ies more massive than M* at z=4, using MOAO (see emission line 
maps). Using GLAO does not allow us to identify these clumps 
anymore: even in the most massive galaxy simulated, all clumps 
are smoothed together in the emission line map. One can still re- 
cover non-circular motions in the velocity field, but without clear 
morphological signature, it is difficult to identify the underlying 
cause of this perturbation. At higher redshift, the limited SNR does 
not allow us to recover such clumps anymore, even in the most 
massive case. 

In summary, these simulations suggest that it is possible to 
recover clumps in rotating disks down to z-4, galaxies using 
MOAO. Using GLAO probably prevent any direct identification of 
clumps in very distant galaxies. 



4.3 Compliance with figures of merit 

The goal of this simulated experiment was to study a large number 
Ngai of galaxies (in order to allow one to derive statistics over the 
morpho-kinematic types within each mass and redshift bin) at 2 < 
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Total 


z=2 
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0.8 


0.3 


2.3 


z=4 


2.3 


1.4 


0.6 


4.3 


z=5.6 




66.9 


16.39 


83.2 


Total 


3.5 


69.1 


17.2 


89.8 



Table 6. Integration time needed to complete the survey, in nights, assuming 
a multiplex M > Ngai/bin- 

z<6 with 0.1 < M,teiiar <5x IO^^Mq in less than 100 nights. We 
adopt the following assumptions as a typical observational strategy 
for this program: 

• Reference case with R=5000 and Api^^SOmsLS (see Sect. 3); 

• MOAO corrections; 

• Mauna-Kea-like background, as described in Sect. 3; 

• A limiting SNR of 10 (10-cr detection); 

• Overheads of 30% (i.e., overhead factor 0H=\.3); 

• The number of effective observed hours per night is assumed 
to be 8 hr; 

• Three redshift "bins" (z=2, z=4, and z=5.6) and three mass 
"bins" per redshift bin (~0.5M*,~ M*, and ~5M*), except for 
the z = 5.6 case, which has only two mass bins (~ M*, and 
~5M*), since we are interested in galaxies having 0.1 < Msteiiar ^ 
5 X lO^^Mo and that M,(z=5.6)=O.8x1O1OM0. We therefore assume 
that the survey is divided into Nbins-^ elementary bins; 

• The number of galaxies per elementary bin is assumed to be 
N gal /bin- This translates into total number of galaxies in the survey 

of Ngal/binXNbins- 

• Within each bin of the survey, the Ngai/bin galaxies can be ob- 
served in N setups setups, with N setup s-NgaiiunlM where M is the mul- 
tiplex advantage of the instrument, i.e., the number of galaxies that 
can be observed at the same time. We assume M ^ Ngai/bin, which 
means that all galaxies in a given bin can be observed using only 
one instrument setup (N setup = !)• 

Under these assumptions, the corresponding integration times, 
in nights, needed in each elementary bin are given in Tab. [6] The 
great collecting power of the E-ELT is clearly reflected in the time 
needed to survey the redshift range 2-4, which requires only ~7 
nights. Regarding the z=5.6 bin, the large thermal background of 
the telescope (see Figs. |7] and |9l) translates into very large integra- 
tion times. 

The total time of the survey, in nights, can be expressed as 
follows: 




As an example, if one wants to construct a 90-nights survey 
with a very large sample of Ngai^lOOO galaxies, a multiplex fac- 
tor of M=125 will be required. However, it is likely that the future 
multi-object integral field spectrograph on the E-ELT, such as EA- 
GLE ( Cub v et al.ll2008 ). will not provide us with such a large mul- 
tiplex capability. At constant T survey, the relation between the mul- 
tiplex capability M and the total number of galaxies in the survey 
is Ngai = 8M. Therefore, if one still requires such a large number 
of galaxies with a lower multiplex factor, e.g., M=25, then a total 
observation time of 5x90=450 nights will be necessary to com- 
plete the survey. Alternatively, one can decide to reduce the num- 
ber of galaxies: with M=25, 200 galaxies can be observed within 
Tsurvey^^^^- Such a program would certainly provide a very inter- 
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Figure 12. Rotation curve of a z=2, M* galaxy (left), and a z=4, 5M* galaxy (right). The black lines represent the original z=0 rotation curve rescaled to the 
size of simulated z=2 and z=4 galaxies. The rotation curve on each side of the galaxy is shown, along with an arctan fit. The rotation curve obtained using 
MOAO is shown in blue, while the one obtained using GLAO is shown in red. Error-bars represent only the uncertainty associated to the two parameters of 
the arctan function, and do not take into account uncertainties associated to other kinematic parameters like the dynamical centre or the Principle Axis. The 
black dashed line represent the rotation curve obtained in simulations without PSF: they correspond to the best rotation curve that one can obtain at the pixel 
scale of the simulations (50mas/pix). 
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Figure 13. From left to right: Simulations of clumpy disks at z=4 using MOAO (from O.IM*, first column, to lOM*, last column; the size of the stamps is, 
from left to right: 0.7, 1.25, 1.6, 2.8, 3.55 arcsec), GLAO at z=4 (lOM*; the size of the stamp is 3.55 arcsec), and MOAO at z=5.6 (lOM*; the size of the stamp 
is 3.0 arcsec). For each column, the first line shows the velocity fields, the second line the velocity dispersion maps, and the third line the emission line maps. 
On the rightest side, we have reproduced the input templates shown in Fig.|4]to ease the comparison with simulations, although the spatial scales are different. 



esting and useful first glimpse of the galaxy mass assembly process 
as a function of time. 

In summary, with a total integration time of 90 nights, and 
within the assumptions listed above, it will be possible to: 

• Observe Ngai galaxies with SNR-10 in three redshift bins (2-4- 
5.6) and three mass bins (0.5M^-1.0M*-5M*), except for the z = 5.6 
bin, which has two mass bins (~ M*, and ^5M^); all bins have 



M galaxies (i.e., the multiplex gain of the instrument), which will 
allow one to do statistics over morpho-kinematic types; 

• Recover the large-scale motions of galaxies at least up to z-4 
and down to Msteiiar - 0.5M*, i.e., of most galaxies in the survey; 

• Recover the detailed kinematics of the most massive galaxies 
( the detection of clumps in rotating disks will be possible down to 
z-4, M* galaxies). 
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5 DISCUSSION 

5.1 Limitations and accuracy of simulations 

5.1.1 Observational inputs 

For simplicity, we adopted several assumptions that might impact 
the results of the simulations in terms of SNR (see Sect. 3). First, 
we adopted a conservative global throughput, while proposed de- 
signed (e.g., EAGLE) predict throughput higher by a factor ~55%. 
This would translate into an increase of the SNR by ~25%, since 

SNR OC ^t trans- 

Second, we have neglected the central obscura- 
tion of the E-ELT, which leads to overestimate the SNR by ~8%, 
since SNR oc D. Finally, we have assumed that the [Oil] doublet 
in a single emis sion line instead o f a doublet. The average [Oil] 
line ratio is 1.4 (iFlores et al.ll2006h . which leads to an overestima- 
tion of the maximal SNR in the emission line by a 41% at constant 
flux (based on the crude assumption of two resolved Gaussians), 
which in turn leads to overestimate the spatial-mean SNR by the 
same factor. All these assumptions lead to overestimate the spatial 
mean SNR by ~24%, or, equivalently, the simulations correspond 
to an EWq ~37A instead of 30A, which was chose n as an extrap- 
olatio n of the mean EWodOll]) observed at z~l jHammer et al.l 
[1993). This new value is actually clos er to the median EWo([Oll]) 
observed at z~l jHammer et al .1 1 1 9971) : such a value is as relevant 
as the first one, and therefore we conclude that the adopted simpli- 
fications do not impact strongly the results. 

The most important limiting factors related to the observa- 
tional parameter space are certainly the emissivity of the telescope, 
which drives the thermal emission from the telescope, which limits 
the detectability of high-z galaxies in the K band, and the brightness 
of the sky continuum between OH lines, which limits observations 
at lower redshift (see Sect. 4.2). As noted in Sect. 3.2, the sky con- 
tinuum could indeed be two times brighter than assumed, based on 
a comparison between Mauna Kea and Paranal sites. This would di- 
rectly impact the achieved SNR by a factor V2, e.g., a mean SNR 
of 10 would translates into a mean SNR of 7.1. On the other hand, 
an emissivity of 15% (pure Al coating) would imply a three times 
higher thermal background, reducing the SNR even further, by a 
factor V3: a mean SNR of 10 would translate into a mean SNR of 
5.8. These diff'erent assumptions directly impact the achieved SNR 
at a given redshift and mass. For instance, this would lower the red- 
shift up to which the GSMF can be probed down to Msteiiar, from 
z=4.1 down to z~3.8-3.9. Therefore, while this could impact ob- 
servations on a galaxy-by-galaxy basis, it would not impact them 
strongly in a statistical sense. 

One of the current limitation of the simulations is that most of 
inputs are currently considered as invariant with wavelength (e.g., 
throughput, emissivity), because the design of the telescope and 
the instrument are not fully known at the moment. Simulations will 
need to be updated once these characteristic curves will be known. 
However, we expect this to be a second order eff'ect, as character- 
istic curves are known to vary only very specific small windows 
(e.g., absorption features in optics transmission curves). 

Finally, it is worth noting that we assumed a photon- starved 
sky subtraction: sky and science frames were simulated using a dif- 
ferent noise realisation, which generated random errors only. This 
choice was driven by the fact that, in the NIR, galaxies are usually 
selected in redshift such that their emission lines fall in spectral 
windows free of OH sky lines. In this case, the sky subtraction pro- 
cess can reasonably well be assumed to be limited by the photon 
noise on the sky continuum. This assumption implies that sky sub- 
traction is not aff'ected by systematic errors. Further simulations 



related to the data reduction strategy will be required to assess the 
impact of such biases (aff'ecting, e.g., measurements and/or estima- 
tion of sky frames). 

5.1.2 Scientific inputs 

In reality, the morpho-kinematic range of templates is significantly 
broader than the one used for the simulations. Because our knowl- 
edge of galaxy formation is still incomplete, the limited number 
of templates was chosen in order to cover a wide range of sur- 
face brightnesses. We have intentionally focused on a few repre- 
sentative cases, ranging from major mergers with complex morpho- 
kinematics to irregular galaxies with flat velocity fields, and regular 
rotating disks with difl'erent velocity gradients. Despite their lim- 
ited number, these templates should span most interested cases. 

The impact of the difl'erent surface brightness distributions can 
be seen in the middle panel of Fig. |9] where error-bars reflect the 
range of mass that can be reached at a given redshift and S/N for the 
10 difl'erent morpho-kinematic templates considered in this work. 
We find a tendency for the most irregular surface brightness distri- 
butions to result in a largest mass limit at a given total flux. Such 
surface brightness distributions result in a more complex coupling 
between adjacent spectra in the IFS at limited spatial resolution, 
which in turn results in emission lines with a lower peak and larger 
wings compared to more regular distributions in surface brightness. 

Results from high resolution imaging and spatially-resolved 
spectroscopy have revealed that high-z galaxies have very dif- 
ferent properties in their morpho logy and kinematics, com- 
pared to l ocal galaxies (e.g., Elmegr een & ElmegreenI |2003; 
iForster Sch reiber et al. 2009). In particular, these observations sug- 
gest that a significa nt fraction of high -z galaxies would be rotating 
clumpy disks fe.g.. iGenzel et al.ll2008i) . We find that at z~2, the 
mass limit of clumpy disks is roughly one order of magnitude larger 
than more regular galaxies. Transposing this result on current 3D 
samples observed with 8- 10m telescopes, this could explain why 
very distant galaxies detected by 3D spectroscopy so far, which 
are l argely dominated by irregular surface brightness distributions 
fe.g. jForster Schreiber et al.ll2009h . systematically show very large 
equivalent widths in their emission lines to be detected. If a signifi- 
cant fraction high-z galaxies are indeed clumpy disks, then the true 
average SNR at a given redshift and mass might be biases toward 
the largest values in the range shown in Fig.|9] 

Most of physical quantities used as inputs are not known to 
a factor better than two. We first discuss the source of uncertainty 
associated with the intrinsic scatter of the scaling relations used to 
rescale the morpho-kinematic templates in terms of flux and size. 
The empirical relations between stellar mass and K-band magni- 
tude as a function of redshift have typical uncertainties of 0.3 dex 
(i.e., 0.75 mag, but likely more at z^4, see Sect. 3.1). This uncer- 
tainty on the stellar mass directly translates into a 0. 1 dex uncer- 
tainty on log Rhaif, using the scaling relation between stellar mass 
and half-light radius Rhaif adopted in Sect. 3.1. This is to be com- 
pared to the intrinsic scatter of this relation (0.3 dex in log Rhaif, 
see ICourteau et al.ll2007h . to the scatter of the size versus w ave- 
length relation (0.04 dex in log Rhaif, see lBarden et al.ll2005l) . and 
to that of the size versus redshift relation (-0 06 dex in los, Rhair , 
see lBouwens et al.ll2004 iFerguson et "aDl2004lDahlen et al.l2607n . 
The two dominant sources of uncertainty are clearly the redshift 
vs. stellar mass / K-band magnitude relation and the scatter of the 
stellar-mass vs. size relation, with roughly an uncertainty of a factor 
two in size and total flux. 

Another source of uncertainty is the possible evolution of 
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these relations with redshift. While the stellar mass vs. size relatio n 
does not appear to evolve, at least up to z~ 1 tearden et alJl2005h . 
the Tully-Fisher relation appears to evolve smoothly with redshift 
(a factor two in mass betw een z=0 and z-0.6 , see Puech et al. 2008, 
slightly more at z~2, see ICresci et Zl lioogh . In this paper, we as- 
sumed no evolution of this relation: this is conservative as, at a 
given stellar mass, high-z rotating disks would have larger rotation 
velocities, which would make the associated velocity gradients eas- 
ier to be detected. The scatter associated to the stellar mass Tully- 
Fisher Relation is ~0.15 dex in log M^teiiar^ meaning that it will not 
introduce significant additional uncertainty on the stellar mass / K- 
band magnitude compared to the stellar mass vs. size relation (see 
above). 

Finally, one of the less constrained physical parameter is per- 
haps the emission line equivalent width in very distant galaxies. 
This parameter does not influence the flux distribution but only its 
total intensity. We have adopted the median value found for [Oil] at 
z~l (see Sect. 3.1 and 5.1.1), with an associated scatt er of 15-20A, 
depending on sample selection ( Hammer et al .1 1 1 997b . L arger val- 
ues were observed at larger redshift, especially with Ka (lErb et al.l 
[2006). It is indeed expected that the typical equivalent widths in- 
crease with redshift, as a result of the increas e of the star for ma- 
tion density with redshift, at least up to z~2 (lHopkinsll2004h . In 
this paper, we chose not to parametrise emission line equivalent 
width as a function of redshift. In practice, such a parametrisation 
would be largely aff'ected by the sample selection criteria at very 
high-z, and the physical processes driving and aff'ecting emission 
within diff'erent lines, as one has to switch from Ha to [Oil] above 
z~4. We therefore decided not to introduce any additional uncer- 
tain parameter and adopted the observed [Oil] median value at z=l 
as a conservative value. Moreover, adopting a constant value helps 
to disentangle the diff'erent sources of surface brightness evolution 
with redshift, which are only due to mass and size evolution in this 
study. The results given in Sect. 4.2 are to this respect conservative, 
especially between z~2 and 4, where observations are not strongly 
limited by the thermal emission from the telescope: considering an 
equivalent wi dth of lOOA in Hof, as it is often observed in z~2 
galaxies fe.g. jForster Schreiber et al.l2009l) . instead of 30A, would 
imply an increase by a factor 3 of the mean SNR (see Sect. 4.2). 

We therefore conclude that simulations are internally- 
consistent within a factor two in all the physical quantities con- 
sidered. For instance, the empirical relations between stellar mass 
and K-band magnitude are uncertain within a factor two, but the 
flux correction between the rest-frame wavelength corresponding 
to the K-band and that of the emission line considered is also less 
than a factor two (see Sect. 3.1). 

Finally, in this DRM program, we assumed that targets will 
be selected from future deep and complete optical-NIR imaging 
and redshift surveys. Such surveys could potentially be done us- 
ing existing facilities such as VISTA or the LSST, but will prob- 
ably also benefit from the follow-up of future instruments such as 
JWST/NIRSPEC or ALMA. To guarantee the representativity of 
the sample and optimise the sampling of the look-back time axis, 
we did not assume any pre- selection in colour, nor do we pre- 
select redshifts in terms of atmospheric transmission as it is cur- 
rently done on 8- 10m telescopes. This allows us to take advantage 
of the huge collecting power of the E-ELT, and target any poten- 
tial galaxy regardless of its redshift. As an alternative observational 
strategy, one could select more optimal redshift windows in corre- 
spondence of atmospheric windows, which would directly result in 
a gain of SNR and survey speed. Looking at Fig. 6, optimal win- 
dows are around 1.6 yum (which corresponds to z~3.3 using [Oil]), 



and 2.2 jim (which corresponds to z~2.3 using Ha or 4.9 using 
[Oil]. Selecting galaxies in these windows would guarantee an at- 
mospheric transmission ~95% (provided that targets are selected 
between strong absorption features), i.e., roughly a factor two larger 
than the atmospheric transmission corresponding to simulations at 
z=2, with 44%, and at z=4, with 63%: for instance, targeting galax- 
ies at z~2.3 instead of z=2, would result in a gain of 2.2 in SNR. 
Such a factor is well below the inherent uncertainty associated to 
the simulations (see above). 

5.2 Impact of telescope design and size 

The telescope is by far the dominant source of background in the K- 
band. Because the SNR is in a background-limited regime at z=5.6 
(see Fig. [9]), the telescope emission limits source detection at very 
high redshift. Note that in the simulations, we have used an opti- 
mistic assumption for the emissivity of the 5-mirror E-ELT design, 
with Etei =59^1. 

MO AO provides only partial seeing corrections, therefore the 
resulting PSF is dom inated by residual atmospheric perturbations 
(lAssemat et al.l2007b . This implies that the telescope diameter does 
not directly influence the spatial resolution of observations, but 
only the achieved SNR. The scaling relation of Sect. 4.2 shows that 
there is no breaking point in telescope diameter: with a smaller 30m 
telescope, one would need two times longer exposures to reach the 
same SNR. 

In terms of survey speed, the E-ELT represents a huge im- 
provement compared to existing 3D spectrograph on 8- 10m tele- 
scopes. The fiducial program depicted in Sect. 4.3 will deliver 75 
galaxies at z~2 covering the whole mass spectrum in only 2.3 
nights. It is wo rth comparing this number wi th, e.g., the SINS sur- 
vey at the VLT. lForster Schreiber et al .1 ( 1200 9^ described this survey 
of 63 galaxies and gave an average integration time of 3.4 hr per 
galaxy (per band), which represent about 27 VLT nights in total. 
Note that this does not account for observing time lost because of 
undetected targets. The E-ELT will allow us to observe roughly the 
same number of galaxies, but saving more than a factor 10 in time. 
In addition, one should consider that SINS galaxies have typically 
an emission line equivalent width of ~100A, while the simulations 
assume 30A. Rescaling simulations with a gain of 3 in equivalent 
width leads to a gain of 9 in integration time. It basically means that 
the SINS survey could be completed in less that one night with the 
E-ELT. Moreover, the E-ELT will enable a better control of the se- 
lection function: because of the limited sensitivity of current 8- 10m 
telescopes, pre-selection in colour must be used to draw 3D distant 
samples beyond z~l (see Sect. 2.1). In comparison, the E-ELT will 
allow us to conduct a very complete 3D follow-up of a pure mass- 
selected sample. Sect. 4.2.1 demonstrates that the E-ELT will allow 
us to draw such samples up to z~4-5. Such a limit will clearly re- 
main out of reach of current 8- 10m telescopes, except for a handful 
of targets. 

5.3 Impact of site properties 

The sky is the dominant source of background only for z < 4 ob- 
servations. Note that in the simulations we have used a sky model 
from Mauna Kea, which is ~2 times fainter in the H band that 
the official DRM Paranal model (see Sect. 3.2 and Fig. [6]). As 
SNR oc \/ ^J background in such a regime, a site having two (four) 

^ see |http://www.eso.org/sci/facilities/eelt/science/drm/tech_data/telescope/| 
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times higher background than the one used in the simulations will 
reduce the achieved SNR from, e.g., 10 to 7 (5). The impact on 
high-redshift observations is therefore relatively limited (see Fig. 
ID). 

The SNR loss between observations with a seeing of 0.8 arc- 
sec and observations with a seeing of 0.95 arcsec is 5-15%. The 
strongest impact of worst seeing conditions will be to limit the ca- 
pability to recover Rotation Curves and detailed kinematics of dis- 
tant galaxies. 

5.4 Impact of Instrument 

GLAO limits the achievable SNR to lower values, compared with 
MO AO. As a consequence, the recovery of the large scale mo- 
tions in distant galaxies using GLAO is limited to higher masses. 
Any further objective of the DRM will be strongly limited in using 
GLAO instead of MOAO (e.g., rotation curves, detailed kinemat- 
ics). 

Appropriate targets in the NIR are usually selected such as 
they have emission lines that fall in regions free of strong OH lines. 
This requires a minimal spectral resolution of R~3000 to resolve 
OH sky lines with enough accuracy. On the other hand, the high- 
est the spectral resolution, the better the accuracy on the recovered 
kinematics (see Tab.|7]): at least R=5000 is required if one wants to 
recover the velocity dispersion with no more that 50% of relative 
uncertainty. This value appears to be a good compromise between 
the desire to minimise the impact of the OH sky lines and not want- 
ing to over-resolve the line by a large factor: the scaling relation 
between SNR and R (see Sect. 4.2) demonstrates the interest of 
having the smallest spectral resolution possible, which optimises 
the achieved SNR. 

The choice of the IFU pixel scale drives the spatial resolu- 
tion of MOAO-fed 3D spectroscopy observations, as in the range 
of EE provided by MOAO, the PSFs have a FWHM largely smaller 
than twice the pixel size. Therefore, the choice of the optimal IFU 
pixel scale is related to the optimal "scale-coupling" between the 
IFU pixel scale and the spatial sc ale of the physical fe ature that one 
wants to recover using this IFU jPuech et al.ll2008bb . This can be 
quantified by using the ratio between the size of this feature (here, 
the galaxy diameter, as one wants to recover the large-scale rota- 
tion) and the size of the IFU resolution element. 3D observations of 
z~0.6 galaxies with FLAMES/GIRAFFE have demonstrated that 
a scale-coupling of at least 3 can alre ady provide us w ith use- 
ful information on large-scale motions jFlores et al.l l2006\ allow- 
ing us to distinguish rota t ing disks from other more complex sys- 
tems jPuech et al.ll2006al : lYang et al.ll2008h . lEpinat et all (l2009bh 
showed that such a scale-coupling is enough to provide us with 
unbiased estimated of the PA and rotation velocity in distant galax- 
ies. It corresponds to the minimum value necessary to ensure that 
each side of the galaxy is at least spatially sampled by the IFU at 
the Nyquist rate. Of course, there are specific situations where a 
finer spatial sampling is desirable in order to properly distinguish 
between complex systems and rotating disks (see, e.g., Epinat et al. 
[2009bl) . but using this as a guide-line provide us with a useful upper 
limit on the IFU spatial sampling, which is optimised for very high 
surface brightness detection. In the z-A simulations, this minimal 
scale-coupling leads to a pixel scale of 55mas, 125 mas, and 280 
mas for 0.1, 1, and 10 M* galaxies, respectively. Hence, a minimal 
pixel scale of 50 mas is required if one wants to be able to recover, 
at least in principle, large-scales motions in z-A galaxies, and for a 
large range of stellar-mass. 

Figure [TOl shows the velocity fields, velocity dispersion maps. 
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Table 7. Accuracy on the velocity (in km/s, first lines) and relative accuracy 
(in %, second lines) on the velocity dispersion measurement as a function 
of spectral resolution (Monte-Carlo simulations). 



and emission line maps extracted from the simulations using a 50 
mas pixel scale. It is clear that the lowest mass case barely pro- 
vides enough spatial information to clearly distinguish between the 
rotating disk and the major merger. Even if the scale-coupling is 
in principle large enough to properly recover large-scale motions, 
the pixel scale limits the achieved SNR to relatively low values (see 
Tab.O. If one only wants to recover large-scale motions in the most 
massive objects, then an IFU pixel scale of 75 mas can be used, 
provid ing more SNR at constant integration time fsee lPuech et al.l 
l2008bh . 

Finally, we adopted 50 mas as a reference pixel scale in the 
simulations. This resulted from a compromise between resolving 
sub-kpc scales (100 mas represent 0.7 kpc at z=4) and the result- 
ing surface brightness sensitivity. We derived the emission line flux 
sensitivity per pixel in the following way. We considered only sim- 
ulations with 50 mas per pixel and R=5000 (the result can be eas- 
ily rescaled to any other couple of spatial and spectral sampling). 
Then, we constructed the histogram of the emission line flux corre- 
sponding to a threshold of S NRun larger than three (which defines 
the spaxels used for analysing the VF, see Sect. 2.1). In this his- 
togram, we considered all simulations with integration times of 24 
hr, but with diff'erent seeing, AO correction and morpho-kinematic 
templates, which allow us to average over a large number of diff'er- 
ent observational conditions. We found that this histogram peaks at 
2xl0"2^erg/s/cm^ 

In the simulations, we also explored a 25 mas pixel scale, 
which allowed us to bracket the EAGLE baseline with 37.5 mas. 
Compared to our reference case, the EAGLE baseline (R=4000 
and 37.5mas/pix) would imply a 16% decrease in SNR per spa- 
tial and spectral element of sampling (see Sect. 4.2.1), and de- 
grade the emission line flux sensitivity limit by a factor 2.2. 
However, the EAGLE pixel scale provides an optimal scale- 
cou pling with distant clumps that are ubiquitous in distant galax- 
ies telmegreen & Elmegr eenI [2005b : such clumps are typically 1 
kpc is diameter, therefore such a pixel scale would provide us with 
a scale-coupling of 3-4 between z=2 and z=4, which is an opti- 
mal con ipromise for studyin g the structure and kinematics of these 
clumps dPuech etal.f2009bh . The SNR per pixel goes linearly with 
the pixel scale: adopting a 4 mas pixel scale, which corresponds to 
the diff'raction limit in H band, would imply a loss of 92% in SNR 
per pixel compared the 50 mas pixel scale. This has some bearing 
on detailed studies of relatively nearby galaxies (and in particular 
of their central AGN) for which the E-ELT will provide a huge 
improvement compared to, e.g., the VLT (see Sect. 5.2), as it will 
allow us to probe finer spatial scales (8 mas represents 0.07 kpc 
at z=2). However, such a finner spatial sampling is much less effi- 
cient in terms of survey speed for observing higher redshift galax- 
ies. Moreover, the structure of distant galaxies at such small spatial 
scales remains uncertain. 
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6 CONCLUSIONS 

We have conducted simulations of the "Physics and mass assem- 
bly of galaxies out of z~6" science case for the E-ELT, exploring 
a wide range of observational and physical parameters. We have 
defined figures of merit for this science case despite the inherent 
complexity of the science goals, derived empirical scaling relations 
between the signal-to-noise ratio of kinematic and intensity maps 
and the main telescope and instrument parameters, as well as a rela- 
tion between the limit in stellar mass that can be reached for a given 
signal-to-noise ratio as a function of redshift. We specifically inves- 
tigated the impact of AO performance on the science goal. We did 
not identify any breaking points with respect to all parameters (e.g., 
the telescope diameter), with the exception of the telescope thermal 
background, which strongly limits the performance in the highest 
(z>5) redshift bin. We find that the full range of science goals can 
be achieved with a ~100 nights program on the E-ELT, provided 
a high multiplex advantage M ~ Ngai/^. We stress that several as- 
sumptions and guided guesses had to be made on both the obser- 
vational conditions and physical characteristics of distant galaxies 
under study. This introduces an inherent uncertainty, which can be 
mitigated with future simulations as the telescope design will be 
consolidated and more details about the physics of high-z galaxies 
will become available. 
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